Frequency compression and its effects in speech recognition.
نویسندگان
چکیده
BACKGROUND frequency compression. AIM to evaluate the index of speech recognition (IPRF) using frequency compression in three different ratios. METHODS monosyllabic words were recorded using an algorithm of frequency compression in three ratios: 1:1, 2:1, 3:1, generating three lists of words. Eighteen listeners accomplished the IPRF using the modified words. They were subdivided in two groups, considering familiarity with the speech material: group of audiologists (F) and group of patients (P). RESULTS a statistically significant decrease in accuracy was observed when using frequency compression. Group F presented a better performance than Group P in all of the applied ratio frequency compression ratios. CONCLUSION Frequency compression hinders speech recognition; as the compression ratio increases, so does the level of difficulty. Familiarity with the words facilitates recognition in any hearing condition.
منابع مشابه
Effects of ageing on speed and temporal resolution of speech stimuli in older adults
Background: According to previous studies, most of the speech recognition disorders in older adults are the results of deficits in audibility and auditory temporal resolution. In this paper, the effect of ageing on timecompressed speech and auditory temporal resolution by word recognition in continuous and interrupted noise was studied. Methods: A time-compressed speech test (TCST) w...
متن کاملEffect of Speech Compression on the Automatic Recognition of Emotions
This paper investigates the effects of standard speech compression techniques on the accuracy of automatic emotion recognition. Effects of Adaptive Multi-Rates (AMR), Adaptive Multi-Rate Wideband (AMR-WB) and Extended Adaptive Multi-Rate Wideband (AMR-WB+) speech codecs were compared against emotion recognition from uncompressed speech. The recognition methods included techniques based on three...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملLexical tone recognition with spectrally mismatched envelopes.
It has been shown that frequency-place mismatch has detrimental effects on English speech recognition. The present study investigated the effects of mismatched spectral distribution of envelopes on Mandarin Chinese tone recognition using a noise-excited vocoder. In Experiment 1, speech samples were processed to simulate a cochlear implant with various insertion depths. The carrier bands were sh...
متن کاملStatistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language
Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pro-fono : revista de atualizacao cientifica
دوره 21 2 شماره
صفحات -
تاریخ انتشار 2009